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Full text available: l g| pdf(518.63 KB) Additional Information: full citation, abstract, r eferenc es, index terms 

Host commercial database systems do (or should) exploit many sorting techniques that 
are publicly known, but not readily available in the research literature. These techniques 
improve both sort performance on modern computer systems and the ability to adapt 
gracefully to resource fluctuations in multiuser operations. This survey collects many of 
these techniques for easy reference by students, researchers, and product developers. It 
covers in-memory sorting, disk-based external sorting, and cons ... 
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John Yiannis, Justin Zobel 
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Volume 16 Issue 2 

Publisher: Springer-Verlag New York, Inc. 

Full text available: 'gjpdf^S.^ KB) Additional Information: full ci tatio n, abstract 

External sorting of large files of records involves use of disk space to store temporary files, 
processing time for sorting, and transfer time between CPU, cache, memory, and disk. 
Compression can reduce disk and transfer costs, and, in the case of external sorts, cut 
merge costs by reducing the number of runs. It is therefore plausible that overall costs of 
external sorting could be reduced through use of compression. In this paper, we propose 
new compression techniques for data consisting of se ... 

Keywords: External sorting, Query evaluation, Semi-static compression, Sorting 
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This paper describes the design, implementation, and evaluation of a Federated Array of 
Bricks (FAB), a distributed disk array that provides the reliability of traditional enterprise 
arrays with lower cost and better scalability. FAB is built from a collection of bricks, small 
storage appliances containing commodity disks, CPU, NVRAM, and network interface cards. 
FAB deploys a new majority-voting-based algorithm to replicate or erasure-code logical 
blocks across bricks and a reconfigurati ... 

Keywords: consensus, disk array, erasure coding, replication, storage, voting 
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Antiquity is a wide-area distributed storage system designed to provide a simple storage 
service for applications like file systems and back-up. The design assumes that all servers 
eventually fail and attempts to maintain data despite those failures. Antiquity uses a 
secure log to maintain data integrity, replicates each log on multiple servers for durability, 
and uses dynamic Byzantine fault-tolerant quorum protocols to ensure consistency among 
replicas. We present Antiquity's design and an ... 

Keywords: archival storage systems, data durability, data integrity, distributed storage 
system, wide-area 
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Publisher: ACM Press 
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" ~ ~ terms , review 

This article presents the deign and implementation of the recovery scheme in Calypso. 
Calypso is a cluster-optimized, distributed file system for UNIX clusters. As in Sprite and 
AFS, Calypso servers are stateful and scale well to a large number of clients. The recovery 
scheme in Calypso is nondisruptive, meaning that open files remain open, client modified 
data are saved, and in-flight operations are properly handled across server recover. The 
scheme uses distributed state amount the client ... 

Keywords: Calypso, cluster systems, distributed state, state reconstruction 
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Publisher: ACM Press 

Full text available: g|pdf(946.00 KB) Additlonal Information: full citation , abstract, references, citings, index 

~~ terms 

We consider an architecture for a serverless distributed file system that does not assume 
mutual trust among the client computers. The system provides security, availability, and 
reliability by distributing multiple encrypted replicas of each file among the client 
machines. To assess the feasibility of deploying this system on an existing desktop 
infrastructure, we measure and analyze a large set of client machines in a commercial 
environment. In particular, we measure and report results on ... 

Keywords: analytical modeling, availability, feasibility analysis, personal computer usage 
data, reliability, security, serverless distributed file system architecture, trust, workload 
characterization 
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